Use of Ratios and Logarithms in Statistical Regression Models

نویسنده

  • Scott S. Emerson
چکیده

In many regression models, we use logarithmic transformations of either the regression summary measure (a log link), the regression response variable (e.g., when analyzing geometric means), or one or more of the predictors. In this manuscript, I discuss the rationale for using logarithmic transformations, the interpretation of ratios, and the general properties of logarithms. 1 Use of Logarithmic Transformations Logarithmic transformations of data and/or parameters are used extensively in statistics. The fundamental reason for this stems from the following logic: 1. We are most often interested in using statistics to detect associations between two variables. 2. By an association, we mean that the distribution of one variable (we call this the “response variable”) is different in some way between groups that are homogeneous for the other variable (we call this the “predictor of interest” or POI). 3. If the two variables are associated, then that means that some aspect of the distribution (some “summary measure” like the mean, geometric mean, etc.) is unequal between two groups that differ in their value of the POI. In describing the association, we will want to describe (a) How the POI differs between the two (conceptual) groups being compared, and (b) How the summary measure of response compares across two groups. 4. There are two simple arithmetic ways to tell if two numbers (e.g., the level of POI in two separate groups, or the mean of the “response variable” in two separate groups) are unequal: (a) their difference is not 0, or (b) their ratio is not 1. 5. We choose between differences and ratios as methods for comparisons based on a variety of criteria that are sometimes competing: (a) People understand differences more than they understand ratios. i. Part of this is because natural language (e.g., English) is not very precise when describing ratios.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing Bivariate and Multivariate Methods in Landslide Sustainability Mapping: A Case Study of Chelchay Watershed

1- INTRODUCTION In the last decades, due to human interventions and the effect of natural factors, the occurrence of landslide increased especially in the north of Iran, where the amount of rainfall is suitable for the landslide occurrence. In order to manage and mitigate the damages caused by landslide, the potential landslide-prone areas should be identified. In landslide susceptibili...

متن کامل

Correlation of air pollutants with land use and traffic measures in Tehran, Iran: A preliminary statistical analysis for land use regression modeling

Land use regression (LUR) models have been globally used to estimate long-term air pollution exposures. The present study aimed to analyze the association of different land use types and traffic measures with air pollutants in Tehran, Iran, as part of the future development of LUR models. Data of the particulate matter (PM10), sulfur dioxide (SO2), and nitrogen dioxide (NO2) were extracted from...

متن کامل

Applying Combined Approach of Sequential Floating Forward Selection and Support Vector Machine to Predict Financial Distress of Listed Companies in Tehran Stock Exchange Market

Objective: Nowadays, financial distress prediction is one of the most important research issues in the field of risk management that has always been interesting to banks, companies, corporations, managers and investors. The main objective of this study is to develop a high performance predictive model and to compare the results with other commonly used models in financial distress prediction M...

متن کامل

ارائۀ ساده‌ترین نسبت‌های طیفی به‌منظور تشخیص برخی خصوصیات شیمیایی خاک در مناطق خشک با استفاده از تکنیک دورسنجی (مطالعۀ موردی: کویر درۀ انجیر بافق)

Introduction Understanding the spectral reflectance of different soils and other surface elements forms the basis for analyzing the process of interpreting remote sensing data. Spectral properties of the various phenomena of the Earth's surface are not constant and are changing, based on the complex time and space conditions. Determination of soil chemical properties using remote sensing techni...

متن کامل

Factors Influencing Drug Injection History among Prisoners: A Comparison between Classification and Regression Trees and Logistic Regression Analysis

Background: Due to the importance of medical studies, researchers of this field should be familiar with various types of statistical analyses to select the most appropriate method based on the characteristics of their data sets. Classification and regression trees (CARTs) can be as complementary to regression models. We compared the performance of a logistic regression model and a CART in predi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014